Automatic speech recognition using acoustic confidence conditioned language models
نویسندگان
چکیده
A modi ed decoding algorithm for automatic speech recognition (ASR) will be described which facilitates a closer coupling between the acoustic and language modeling components of a speech recognition system. This closer coupling is obtained by extracting word level measures of acoustic con dence during decoding, and making coded representations of these con dence measures available to the ASR network during decoding. A simulation of this decoding strategy is implemented using a word lattice rescoring paradigm. A joint acoustic{language model will be described where linguistic context is augmented to include the encoded values of acoustic con dence. Finally, the performance of the word lattice based implementation of the decoding algorithm will be evaluated on a large vocabulary natural language understanding task.
منابع مشابه
Allophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملConfidence Measures in HMM/MLP Hybrid Speech Recognition for Turkish Language
A confidence measure is defined as the posterior probability of word correctness given the values of confidence indicators [8]. Confidence measures can be calculated from a posteriori probability of recognized word or sub-word unit inferred from some acoustic models and language models, including various normalization techniques. In this work we present several confidence measures and propose t...
متن کاملComparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer
An automatic speech recognizer uses acoustic knowledge and linguistic knowledge. In large vocabulary speech recognition, acoustic knowledge is modeled by hidden Markov models (HMM), linguistic knowledge is modeled by N-gram (typically bi-gram or trigram), and these models are stochastically integrated. It is thought that humans also integrate acoustic and linguistic knowledge of speech when per...
متن کاملCroatian Large Vocabulary Automatic Speech Recognition
This paper presents procedures used for development of a Croatian large vocabulary automatic speech recognition system (LVASR). The proposed acoustic model is based on context-dependent triphone hidden Markov models and Croatian phonetic rules. Different acoustic and language models, developed using a large collection of Croatian speech, are discussed and compared. The paper proposes the best f...
متن کاملAsr for Automatic Directory Assistance: the Smada Project
In this paper we summarise the state-of-the-art for automatic speech recognition in automated Directory Assistance at the start of the 5th Framework project SMADA. Details are given about robust acoustic features for use in Distributed Speech Recognition, especially with respect to noise suppression. Then an overview is given of the confidence measures which are in use today, and their similari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999